Search CORE

98 research outputs found

Recommended from our members

Health Effects Associated With Electronic Cigarette Use: Automated Mining of Online Forums.

Author: Hristidis Vagelis
Hua My
Sadah Shouq
Talbot Prue
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

BACKGROUND:Our previous infodemiological study was performed by manually mining health-effect data associated with electronic cigarettes (ECs) from online forums. Manual mining is time consuming and limits the number of posts that can be retrieved. OBJECTIVE:Our goal in this study was to automatically extract and analyze a large number (>41,000) of online forum posts related to the health effects associated with EC use between 2008 and 2015. METHODS:Data were annotated with medical concepts from the Unified Medical Language System using a modified version of the MetaMap tool. Of over 1.4 million posts, 41,216 were used to analyze symptoms (undiagnosed conditions) and disorders (physician-diagnosed terminology) associated with EC use. For each post, sentiment (positive, negative, and neutral) was also assigned. RESULTS:Symptom and disorder data were categorized into 12 organ systems or anatomical regions. Most posts on symptoms and disorders contained negative sentiment, and affected systems were similar across all years. Health effects were reported most often in the neurological, mouth and throat, and respiratory systems. The most frequently reported symptoms and disorders were headache (n=939), coughing (n=852), malaise (n=468), asthma (n=916), dehydration (n=803), and pharyngitis (n=565). In addition, users often reported linked symptoms (eg, coughing and headache). CONCLUSIONS:Online forums are a valuable repository of data that can be used to identify positive and negative health effects associated with EC use. By automating extraction of online information, we obtained more data than in our prior study, identified new symptoms and disorders associated with EC use, determined which systems are most frequently adversely affected, identified specific symptoms and disorders most commonly reported, and tracked health effects over 7 years

eScholarship - University of California

Relevance-based Retrieval on Hidden-Web Text Databases without Ranking Support

Author: Hristidis Vagelis
Hu Yuheng
Ipeirotis Panagiotis G.
Publication venue
Publication date: 21/09/2009
Field of study

Many online or local data sources provide powerful querying mechanisms but limited ranking capabilities. For instance, PubMed allows users to submit highly expressive Boolean keyword queries, but ranks the query results by date only. However, a user would typically prefer a ranking by relevance, measured by an Information Retrieval (IR) ranking function. The naive approach would be to submit a disjunctive query with all query keywords, retrieve the returned documents, and then re-rank them. Unfortunately, such an operation would be very expensive due to the large number of results returned by disjunctive queries. In this paper we present algorithms that return the top results for a query, ranked according to an IR-style ranking function, while operating on top of a source with a Boolean query interface with no ranking capabilities (or a ranking capability of no interest to the end user). The algorithms generate a series of conjunctive queries that return only documents that are candidates for being highly ranked according to a relevance metric. Our approach can also be applied to other settings where the ranking is monotonic on a set of factors (query keywords in IR) and the source query interface is a Boolean expression of these factors. Our comprehensive experimental evaluation on the PubMed database and a TREC dataset show that we achieve order of magnitude improvement compared to the current baseline approaches.Vagelis Hristidis was partly supported by NSF grant IIS-0811922 and DHS grant 2009-ST-062-000016. Panagiotis G.\ Ipeirotis was supported by the National Science Foundation under Grant No. IIS-0643846

New York University Faculty Digital Archive

Using Social Media to Explore Mental Health-Related Behaviors and Discussions among Young Adults

Author: Garett Renee
Hristidis Vagelis
Shahbazi Moloud
Young Sean
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2019
Field of study

Abstract There have been recurring reports of online harassment and abuse among adolescents and young adults through Anonymous Social Networking websites (ASNs). We explored discussions related to social and mental health behaviors among college students, including cyberbullying on the popular ASN, Yik Yak. From April 6, 2016, to May 7, 2016, we collected anonymous conversations posted on Yik Yak at 19 universities in four different states. We found that prosocial messages were approximately five times as prevalent as bullying messages. Frequency of cyberbullying messages was positively associated with messages seeking emotional help. We found significant geographic variation in the frequency of messages offering supportive versus bullying messages. Across campuses bullying and political discussion were positively associated. Results suggest that ASN sites can be mined for real-time data about students’ mental health-related attitudes and behaviors. We discuss the implications for using this information in education and healthcare services

Crossref

ScholarSpace at University of Hawai'i at Manoa

AIS Electronic Library (AISeL)

BORG: Block-reORGanization and Self-optimization in Storage Systems

Author: Bhadkamkar Medha
Burnett Sam
Guerra Jorge
Hristidis Vagelis
Liptak Jason
Rangaswami Raju
Useche Luis
Publication venue: SURFACE at Syracuse University
Publication date: 01/01/2007
Field of study

This paper presents the design, implementation, and evaluation of BORG, a self-optimizing storage system that performs automatic block reorganization based on the observed I/O workload. BORG is motivated by three characteristics of I/O workloads: non-uniform access frequency distribution, temporal locality, and partial determinism in non-sequential accesses. To achieve its objective, BORG manages a small, dedicated partition on the disk drive, with the goal of servicing a majority of the I/O requests from within this partition with significantly reduced seek and rotational delays. BORG is transparent to the rest of the storage stack, including applications, file system(s), and I/O schedulers, thereby requiring no or minimal modification to storage stack implementations. We evaluated a Linux implementation of BORG using several real-world workloads, including individual user desktop environments, a web-server, a virtual machine monitor, and an SVN server. These experiments comprehensively demonstrate BORG’s effectiveness in improving I/O performance and its incurred resource overhead

Syracuse University Research Facility and Collaborative Environment

User-centric Music Information Retrieval

Author: Bo Shao
Bo Shao
Dean Amir Mirmiran
Major Professor
Mohammed Hadi
Tao Li
Vagelis Hristidis
Publication venue: FIU Digital Commons
Publication date: 01/01/2011
Field of study

The rapid growth of the Internet and the advancements of the Web technologies have made it possible for users to have access to large amounts of on-line music data, including music acoustic signals, lyrics, style/mood labels, and user-assigned tags. The progress has made music listening more fun, but has raised an issue of how to organize this data, and more generally, how computer programs can assist users in their music experience. An important subject in computer-aided music listening is music retrieval, i.e., the issue of efficiently helping users in locating the music they are looking for. Traditionally, songs were organized in a hierarchical structure such as genre-\u3eartist-\u3ealbum-\u3etrack, to facilitate the users’ navigation. However, the intentions of the users are often hard to be captured in such a simply organized structure. The users may want to listen to music of a particular mood, style or topic; and/or any songs similar to some given music samples. This motivated us to work on user-centric music retrieval system to improve users’ satisfaction with the system. The traditional music information retrieval research was mainly concerned with classification, clustering, identification, and similarity search of acoustic data of music by way of feature extraction algorithms and machine learning techniques. More recently the music information retrieval research has focused on utilizing other types of data, such as lyrics, user access patterns, and user-defined tags, and on targeting non-genre categories for classification, such as mood labels and styles. This dissertation focused on investigating and developing effective data mining techniques for (1) organizing and annotating music data with styles, moods and user-assigned tags; (2) performing effective analysis of music data with features from diverse information sources; and (3) recommending music songs to the users utilizing both content features and user access patterns

CiteSeerX

DigitalCommons@Florida International University

Information Discovery on Electronic Health Records Using Authority Flow Techniques

Author: A Balmin
A Singhal
A Singhal
AK Sehgal
CJ McDonald
DL Shepelyansky
F Farfán
H Hwang
J Savoy
JF Fontaine
L Guo
M Brinkmeier
MG Weiner
MI Lieberman
Michael Weiner
Paul Biondich
R Moskovitch
R Motwani
R Varadarajan
Ramakrishna R Varadarajan
RM Podowski
S Agrawal
S Brin
SE Robertson
SE Robertson
T Haveliwala
T Matsunaga
V Hristidis
V Hristidis
Vagelis Hristidis
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background As the use of electronic health records (EHRs) becomes more widespread, so does the need to search and provide effective information discovery within them. Querying by keyword has emerged as one of the most effective paradigms for searching. Most work in this area is based on traditional Information Retrieval (IR) techniques, where each document is compared individually against the query. We compare the effectiveness of two fundamentally different techniques for keyword search of EHRs. Methods We built two ranking systems. The traditional BM25 system exploits the EHRs' content without regard to association among entities within. The Clinical ObjectRank (CO) system exploits the entities' associations in EHRs using an authority-flow algorithm to discover the most relevant entities. BM25 and CO were deployed on an EHR dataset of the cardiovascular division of Miami Children's Hospital. Using sequences of keywords as queries, sensitivity and specificity were measured by two physicians for a set of 11 queries related to congenital cardiac disease. Results Our pilot evaluation showed that CO outperforms BM25 in terms of sensitivity (65% vs. 38%) by 71% on average, while maintaining the specificity (64% vs. 61%). The evaluation was done by two physicians. Conclusions Authority-flow techniques can greatly improve the detection of relevant information in EHRs and hence deserve further study.</p

Crossref

IUPUIScholarWorks

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DigitalCommons@Florida International University

Predicting the Effectiveness of Keyword Queries on Databases

Author: Cheng Shiwen
Hristidis Vagelis
Termehchy Arash
Publication venue
Publication date: 01/01/2012
Field of study

Keyword query interfaces (KQIs) for databases provide easy access to data, but often su er from low ranking quality, i.e. low precision and/or recall, as shown in recent bench- marks. It would be useful to be able to identify queries that are likely to have low ranking quality to improve the user satisfaction. For instance, the system may suggest to the user alternative queries for such hard queries. In this paper, we analyze the characteristics of hard queries and propose a novel framework to measure the degree of di culty for a key- word query over a database, considering both the structure and the content of the database and the query results. We devise e cient algorithms to compute the degree of di culty at query-time, and show that the overhead is very small com- pared to the query execution time. We evaluate our query di culty prediction model against two relevance judgment benchmarks for keyword search on databases, INEX and SemSearch. Our study shows that our model predicts the hard queries with high accuracy.published or submitted for publicationnot peer reviewe

Crossref

Illinois Digital Environment for Access to Learning and Scholarship Repository